{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# COMPSCI 389: Homework 3\n", "\n", "**Assigned**: March 26, 2024. **Due**: April 2, 2024 at 2:00pm Eastern. **Note**: Submissions received after 2:00pm Eastern on April 9, 2024 will receive no credit.\n", "\n", "**Submitting**: Upload your submission on Gradescope as a `.pdf`. Converting to a PDF can be a complicated process, and so we encourage you to test this process well in advance of the submission deadlines. We recommend converting to HTML, opening the HTML file in a browser, and then printing or exporting to a PDF from your browser. We do not recommend directly converting to a PDF, since this requires installing xelatex. To convert to HTML in VSCode, press `ctrl+shift+p` and type `export`, and you should see an option to export to HTML.\n", "\n", "**Note**: Keep your `.ipynb` file, as we may request it directly (via email).\n", "\n", "**Note**: When converting to a PDF file, ensure that all of your code cells have been executed. The results of these executions *must* be included in your submitted PDF." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Instructions\n", "\n", "Complete the questions below, replacing the blue text with your own answers (your answers do not need to remain in blue). Do **not** modify the green text. Try to answer the questions without consulting your notes or any online material. If you cannot, then consult your notes, and if absolutely necessary, consult course materials (slides, notebooks) and/or Wikipedia. Do **not** use other sources or tools like ChatGPT. Complete this part of the assignment on your own (do **not** work with others).\n", "\n", "After you have completed all of the questions, at the bottom of this assignment you will find a link to another notebook, `Homework 3 Solutions.ipynb`. This contains the solutions, and instructions for ensuring that your answers are correct and sufficient. Make another pass through your homework assignment, replacing the green text with descriptions of what you missed for each question, and providing the fixes necessary to make your answer correct. **The solutions file may include additional instructions, which may include additional content to respond to even if you got a question correct (e.g., additional reflection).** During this second stage where you are filling in your answers, replacing the green text, you may reference the solutions, work with others, and use any tools (including ChatGPT).\n", "\n", "You will only submit this assignment once after replacing both the blue and green text. You do not need to submit the assignment between the first and second passes. Grading for each question will be based on whether you followed this process, and arrived at the correct answers and have sufficient discussion/text in the end. Points will be deducted if you did not make a reasonable effort to answer the question initially, if your final answer remains incorrect, of if your answers were not sufficiently clear (so, write in full sentences with proper punctuation, and conveying your arguments clearly). Other than verifying that you made a reasonable initial effort for your initial answers (blue), points will **not** be deducted due to *initial* answers being incorrect. Hence, there is no reason to break the rules to obtain correct answers initially." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Part 1: Short Answer\n", "\n", "Answer the following questions with at least a few sentences, and no more than roughly one page of text." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 1. [10 points] Consider the plot created by the python code below. How many local minima does this function have? How many global minima? Roughly where are they (rough visual estimates from the plot are sufficient).\n", "\n", "Note: The function is \n", "$$\n", "f(x)=x^6 - 15x^5 + 85x^4 - 225x^3 + 274x^2 - 130x.\n", "$$" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "image/png": "", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "import matplotlib.pyplot as plt\n", "import numpy as np\n", "\n", "# Define the 6th order polynomial function\n", "def polynomial(x):\n", " # Example of a 6th order polynomial\n", " return x**6 - 15*x**5 + 85*x**4 - 225*x**3 + 274*x**2 - 130*x\n", "\n", "# Generate a range of x values\n", "x = np.linspace(-0.2, 5.4, 400)\n", "\n", "# Compute y values\n", "y = polynomial(x)\n", "\n", "# Plotting the function\n", "plt.plot(x, y)\n", "plt.xlabel('x')\n", "plt.ylabel('f(x)')\n", "plt.grid(True)\n", "plt.show()\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 2. [10 points] Is the use of one hot encodings more important for nominal or ordinal features? Why?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 3. [10 points] Recall that in lecture we ran gradient descent on the least squares loss for the GPA data set (as a regression problem). We observed that we had to use extremely small step sizes for gradient descent to not cause the model parameters to diverge. Why was this? How did we fix this?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 4. [10 points] You are working on a machine learning project to predict the sale prices of houses in a city. The data set includes features such as the number of bedrooms, number of bathrooms, square footage, and the age of the house. However, you notice that many entries in the 'number of bathrooms' column are missing.\n", "\n", "A) Explain the potential bias that could result from simply discarding rows with missing 'number of bathrooms' data when predicting house prices.\n", "\n", "B) As an alternative to discarding these rows, propose a method to handle these missing values (you may describe one that we covered in lecture). \n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 5. [5 points] Consider the artificial neural network architecture described by the following code. How many hidden layers does this network have?\n", "\n", "```\n", "class Question4Network(nn.Module):\n", " def __init__(self, num_classes):\n", " super(FeedforwardNeuralNetModel, self).__init__()\n", " \n", " self.fc0 = nn.Linear(9, 10) \n", " self.fc1 = nn.Linear(hidden_size, 10) \n", " self.fc2 = nn.Linear(hidden_size, 10)\n", " self.fc3 = nn.Linear(hidden_size, 1) \n", "\n", " def forward(self, x):\n", " out = F.relu(self.fc0(x))\n", " out = F.relu(self.fc1(out))\n", " out = F.relu(self.fc2(out))\n", " out = self.fc3(out)\n", " return out\n", "```\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 6. [10 points] Consider the network from the previous question, but with the ReLU activation function removed (that is, no activation function). Why would this not be a reasonable parametric model to use?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 7. [5 points] What values are computed during the forwards pass in reverse mode automatic differentiation (the form of automatic differentiation that we focussed on in lecture)?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 8. [10 points] What values are computed during the backwards pass in reverse mode automatic differentiation?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 9. [5 points] Consider an expression, $f(x)$, that includes a $\\sin(z)$ term (where $z$ may depend on $x$). In order to implement reverse mode automatic differentiation, what derivative do we need to work out? What values do we assume are known?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 10. [10 points] Solve the derivative described in Question 9.\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 11. [10 points] In all of the network architectures that we discussed in lecture for computer vision tasks (tasks with images as input), convolutional layers were included as the first few layers of the network. Why are convolutional layers used at the start of the network (early hidden layers) rather than near the end of the network (later hidden layers, after some fully connected layers)?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### 12. [5 points] What is the purpose of max pooling layers in convolutional neural networks?\n", "\n", "***Initial Answer***\n", "\n", "Replace this text with your answer.\n", "\n", "---\n", "\n", "***Updated Answer***\n", "\n", "Replace this text with your response to the solution document.\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The solutions can be found here: [https://people.cs.umass.edu/~pthomas/courses/COMPSCI_389_Spring2024/Homework%203%20Solutions.ipynb](https://people.cs.umass.edu/~pthomas/courses/COMPSCI_389_Spring2024/Homework%203%20Solutions.ipynb)." ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.11.7" } }, "nbformat": 4, "nbformat_minor": 2 }